Model Selection

Common Voice adaptation

# Common Voice adaptation

Whisper Medium Catalan

This is a speech recognition model fine-tuned on the Catalan Common Voice 11.0 dataset based on OpenAI Whisper Medium.

Speech Recognition

Transformers Other

Wav2vec2 Large Xlsr 53 Tr Fine Tuning Deprecated

This model is a speech recognition model fine-tuned on the Common Voice Turkish dataset based on facebook/wav2vec2-large-xlsr-53

Speech Recognition

Wav2vec2 Large Xlsr Kyrgyz

This is an automatic speech recognition model fine-tuned on the Kyrgyz Common Voice dataset, based on the facebook/wav2vec2-large-xlsr-53 model.

Speech Recognition Other

Wav2vec2 Large Xlsr Turkish

This is an automatic speech recognition model fine-tuned on the Turkish Common Voice dataset based on the facebook/wav2vec2-large-xlsr-53 model, achieving a test WER of 21.13%.

Speech Recognition Other

Wav2vec2 Large Xlsr 53 W2V2 TATAR SMALL

This model is a Tatar automatic speech recognition model fine-tuned on the Common Voice 8 dataset based on facebook/wav2vec2-large-xlsr-53, with a test set WER of 53.16%.

Speech Recognition

Transformers Other

Wav2vec2 Large Xlsr 53 Polish

XLSR-53 large model speech recognition system optimized for Polish, fine-tuned based on facebook/wav2vec2-large-xlsr-53, supports Polish automatic speech recognition

Speech Recognition Other

Wav2vec2 Large Xlsr Sorbian

A speech recognition model fine-tuned on Common Voice Upper Sorbian data based on facebook/wav2vec2-large-xlsr-53, supporting automatic speech recognition tasks for Upper Sorbian.

Speech Recognition Other

Wav2vec2 Large Xlsr Estonian

This is an Estonian automatic speech recognition (ASR) model fine-tuned from the facebook/wav2vec2-large-xlsr-53 model, trained using the Common Voice dataset.

Speech Recognition Other

Wav2vec2 Large Xlsr Breton

A speech recognition model fine-tuned on the Breton Common Voice dataset based on facebook/wav2vec2-large-xlsr-53

Speech Recognition Other

Xlsr En Punctuation

Fine-tuned automatic speech recognition model based on facebook/wav2vec2-large-xlsr-53 on the English Common Voice dataset, supporting punctuation prediction

Speech Recognition English

Wav2vec2 Large Xlsr 53 Turkish

This is an automatic speech recognition model fine-tuned on the Turkish Common Voice dataset based on Facebook's wav2vec2-large-xlsr-53 model.

Speech Recognition Other

Wav2vec2 Large Xlsr Kyrgyz

A Kyrgyz speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, trained on Common Voice dataset with a word error rate of 34.08%.

Speech Recognition Other

Wav2vec2 Large Xlsr Persian

A fine-tuned automatic speech recognition model for Persian (Farsi) based on facebook/wav2vec2-large-xlsr-53, supporting 16kHz sampled audio input.

Speech Recognition Other

Wav2vec2 Large Xlsr Or

Automatic speech recognition model fine-tuned on Odia language based on Facebook's wav2vec2-large-xlsr-53 model

Speech Recognition Other

Wav2vec2 Xls R 300m Br Small

This model is a fine-tuned version of facebook/wav2vec2-xls-r-300m on the Common Voice dataset, supporting Breton (br) speech recognition tasks.

Speech Recognition

Transformers Other

Wav2vec2 Large Xlsr 53 Russian

A Russian speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, supporting 16kHz sampled audio input

Speech Recognition Other

Wav2vec2 Large Xlsr 53 Spanish

This is an automatic speech recognition (ASR) model fine-tuned on the Spanish Common Voice dataset, based on the facebook/wav2vec2-large-xlsr-53 model.

Speech Recognition Spanish

Wav2vec2 Large Xlsr 53 Lithuanian

An automatic speech recognition model fine-tuned for Lithuanian using the Common Voice dataset, based on the facebook/wav2vec2-large-xlsr-53 model.

Speech Recognition Other

Wav2vec2 Large Xlsr Welsh

An automatic speech recognition model fine-tuned on the Welsh Common Voice dataset based on facebook/wav2vec2-large-xlsr-53, achieving a test WER of 29.4%.

Speech Recognition Other

Wav2vec2 Large Xlsr Pa IN

A speech recognition model fine-tuned on the Punjabi Common Voice dataset based on facebook/wav2vec2-large-xlsr-53

Speech Recognition

Wav2vec2 Xls R 300m As CV8 V1

Assamese (Assamese) speech recognition model fine-tuned on the Common Voice 8.0 dataset based on facebook/wav2vec2-xls-r-300m

Speech Recognition

Transformers Other

Wav2vec2 Large Xlsr Greek 2

A speech recognition model fine-tuned on the Greek Common Voice dataset based on facebook/wav2vec2-large-xlsr-53, balancing the training set with synthesized female voice data

Speech Recognition

Transformers Other

Wav2vec2 Xls R 300m Lg

This model is a fine-tuned speech recognition model based on facebook/wav2vec2-xls-r-300m on the COMMON_VOICE - LG dataset, supporting automatic speech recognition tasks for Luganda (lg).

Speech Recognition

Transformers Other

Wav2vec2 Xls R 300m Cs Cv8

A speech recognition model fine-tuned on the Common Voice 8.0 Czech dataset based on facebook/wav2vec2-xls-r-300m

Speech Recognition

Transformers Other

Wav2vec2 Large Xlsr Italian

An Italian speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, achieving a word error rate of 13.91% on the Common Voice Italian test set

Speech Recognition Other

Wav2vec2 Large Xlsr Mongolian

This is an automatic speech recognition model fine-tuned on the Mongolian Common Voice dataset based on facebook/wav2vec2-large-xlsr-53

Speech Recognition Other

Wav2vec2 Xls R 1b Ro

This model is an automatic speech recognition model fine-tuned on the Romanian Common Voice 7.0 dataset based on facebook/wav2vec2-xls-r-1b.

Speech Recognition

Transformers Other

Wav2vec2 Large Xlsr Tamil

An automatic speech recognition model fine-tuned on the Tamil language using the Common Voice dataset, based on facebook/wav2vec2-large-xlsr-53.

Speech Recognition Other

Wav2vec2 Large Xlsr 53 Portuguese

This is a fine-tuned XLSR-53 large model for Portuguese speech recognition tasks, trained on the Common Voice 6.1 dataset, supporting Portuguese speech-to-text conversion.

Speech Recognition Other

Wav2vec2 Xlsr Chuvash

A fine-tuned model based on facebook/wav2vec2-large-xlsr-53 for Chuvash automatic speech recognition tasks.

Speech Recognition Other

Wav2vec2 Xls R 300m Pa IN R5

This is an automatic speech recognition model fine-tuned on the Punjabi (India) dataset based on the facebook/wav2vec2-xls-r-300m model.

Speech Recognition

Wav2vec2 Xls R 300m Romanian

A Romanian speech recognition model fine-tuned based on facebook/wav2vec2-xls-r-300m, achieving a WER of 12.46% on the Common Voice Romanian test set

Speech Recognition

Wav2vec2 Large Xlsr 53 Tamil

This is an automatic speech recognition model fine-tuned on the Tamil Common Voice dataset based on facebook/wav2vec2-large-xlsr-53.

Speech Recognition Other

Wav2vec2 Large Xlsr 53 Georgian

This is a Georgian automatic speech recognition (ASR) model fine-tuned from the facebook/wav2vec2-large-xlsr-53 model, trained using the Common Voice dataset.

Speech Recognition Other

MehdiHosseiniMoghadam

Wav2vec2 Large Xlsr 53 Breton

This is a Breton automatic speech recognition model based on the XLSR Wav2Vec2 architecture, fine-tuned on the Common Voice dataset.

Speech Recognition Other

Xlsr 300m CV 8.0 50 EP New Params Nl

This is an automatic speech recognition (ASR) model based on the XLS-R architecture with 300M parameters, specifically optimized for Dutch and trained on the Common Voice 8.0 dataset.

Speech Recognition

Transformers Other

Xlsr300m Cv 7.0 Nl Lm

XLS-R-300M is an automatic speech recognition (ASR) model specifically optimized for Dutch, trained on the Common Voice 8 Dutch dataset.

Speech Recognition

Transformers Other

This is an automatic speech recognition model fine-tuned on the COMMON_VOICE - AB dataset, based on the XLS-R Dummy architecture

Speech Recognition

Transformers Other

Wav2vec2 Xls R 300m Rm Sursilv D11

This model is an automatic speech recognition model fine-tuned on the Romansh-Sursilvan dialect dataset based on facebook/wav2vec2-xls-r-300m, achieving a 24.09% Word Error Rate (WER) on the Common Voice 8 test set.

Speech Recognition

Wav2vec2 Xls R 300m Kk N2

This is an automatic speech recognition (ASR) model fine-tuned on Kazakh (KK) speech datasets based on the facebook/wav2vec2-xls-r-300m model.

Speech Recognition

Transformers Other

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase